Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 4399 |
| Missing cells | 6887 |
| Missing cells (%) | 7.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 756.2 KiB |
| Average record size in memory | 176.0 B |
Variable types
| Numeric | 18 |
|---|---|
| Categorical | 3 |
| DateTime | 1 |
grade is highly correlated with sqft_basement and 5 other fields | High correlation |
sqft_basement is highly correlated with grade and 5 other fields | High correlation |
bathrooms is highly correlated with grade and 5 other fields | High correlation |
bedrooms is highly correlated with sqft_basement and 4 other fields | High correlation |
sqft_above is highly correlated with grade and 7 other fields | High correlation |
sqft_living15 is highly correlated with grade and 3 other fields | High correlation |
floors is highly correlated with bedrooms and 3 other fields | High correlation |
yr_renovated is highly correlated with jhygtf | High correlation |
yr_built is highly correlated with zipcode and 3 other fields | High correlation |
jhygtf is highly correlated with yr_renovated | High correlation |
sqft_lot is highly correlated with sqft_lot15 | High correlation |
price is highly correlated with bathrooms and 2 other fields | High correlation |
sqft_lot15 is highly correlated with sqft_lot | High correlation |
sqft_living is highly correlated with grade and 7 other fields | High correlation |
view is highly correlated with waterfront | High correlation |
waterfront is highly correlated with view | High correlation |
zipcode is highly correlated with yr_built | High correlation |
condition is highly correlated with yr_built | High correlation |
sqft_basement has 2677 (60.9%) missing values | Missing |
yr_renovated has 4187 (95.2%) missing values | Missing |
df_index has unique values | Unique |
jhygtf has 4184 (95.1%) zeros | Zeros |
Reproduction
| Analysis started | 2022-09-21 03:00:48.151695 |
|---|---|
| Analysis finished | 2022-09-21 03:02:20.816947 |
| Duration | 1 minute and 32.67 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
| Distinct | 4399 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18666.95044 |
| Minimum | 14 |
|---|---|
| Maximum | 111906 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 14 |
|---|---|
| 5-th percentile | 1161.6 |
| Q1 | 6273 |
| median | 14079 |
| Q3 | 26831.5 |
| 95-th percentile | 52858.4 |
| Maximum | 111906 |
| Range | 111892 |
| Interquartile range (IQR) | 20558.5 |
Descriptive statistics
| Standard deviation | 16363.50712 |
|---|---|
| Coefficient of variation (CV) | 0.8766031263 |
| Kurtosis | 1.897401415 |
| Mean | 18666.95044 |
| Median Absolute Deviation (MAD) | 9393 |
| Skewness | 1.353343821 |
| Sum | 82115915 |
| Variance | 267764365.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 32828 | 1 | < 0.1% |
| 9797 | 1 | < 0.1% |
| 27672 | 1 | < 0.1% |
| 26790 | 1 | < 0.1% |
| 48844 | 1 | < 0.1% |
| 26971 | 1 | < 0.1% |
| 3563 | 1 | < 0.1% |
| 10676 | 1 | < 0.1% |
| 24609 | 1 | < 0.1% |
| 8169 | 1 | < 0.1% |
| Other values (4389) | 4389 |
| Value | Count | Frequency (%) |
| 14 | 1 | |
| 15 | 1 | |
| 18 | 1 | |
| 20 | 1 | |
| 27 | 1 | |
| 29 | 1 | |
| 30 | 1 | |
| 31 | 1 | |
| 41 | 1 | |
| 42 | 1 |
| Value | Count | Frequency (%) |
| 111906 | 1 | |
| 103340 | 1 | |
| 93792 | 1 | |
| 91932 | 1 | |
| 91673 | 1 | |
| 85147 | 1 | |
| 85126 | 1 | |
| 83869 | 1 | |
| 83788 | 1 | |
| 83746 | 1 |
| Distinct | 70 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 4 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98077.5165 |
| Minimum | 98001 |
|---|---|
| Maximum | 98199 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 98001 |
|---|---|
| 5-th percentile | 98004 |
| Q1 | 98032 |
| median | 98065 |
| Q3 | 98117 |
| 95-th percentile | 98177 |
| Maximum | 98199 |
| Range | 198 |
| Interquartile range (IQR) | 85 |
Descriptive statistics
| Standard deviation | 53.65570198 |
|---|---|
| Coefficient of variation (CV) | 0.0005470744355 |
| Kurtosis | -0.8578788485 |
| Mean | 98077.5165 |
| Median Absolute Deviation (MAD) | 42 |
| Skewness | 0.4100925325 |
| Sum | 431050685 |
| Variance | 2878.934355 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 98103 | 129 | 2.9% |
| 98117 | 129 | 2.9% |
| 98052 | 127 | 2.9% |
| 98115 | 124 | 2.8% |
| 98023 | 117 | 2.7% |
| 98034 | 112 | 2.5% |
| 98118 | 106 | 2.4% |
| 98133 | 104 | 2.4% |
| 98042 | 104 | 2.4% |
| 98033 | 99 | 2.3% |
| Other values (60) | 3244 |
| Value | Count | Frequency (%) |
| 98001 | 88 | |
| 98002 | 43 | |
| 98003 | 49 | |
| 98004 | 67 | |
| 98005 | 41 | |
| 98006 | 93 | |
| 98007 | 20 | 0.5% |
| 98008 | 63 | |
| 98010 | 14 | 0.3% |
| 98011 | 46 |
| Value | Count | Frequency (%) |
| 98199 | 64 | |
| 98198 | 58 | |
| 98188 | 22 | 0.5% |
| 98178 | 61 | |
| 98177 | 55 | |
| 98168 | 52 | |
| 98166 | 46 | |
| 98155 | 93 | |
| 98148 | 13 | 0.3% |
| 98146 | 50 |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.655070487 |
| Minimum | 3 |
|---|---|
| Maximum | 13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 7 |
| median | 7 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 13 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.180323667 |
|---|---|
| Coefficient of variation (CV) | 0.154188478 |
| Kurtosis | 1.078089479 |
| Mean | 7.655070487 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7560926603 |
| Sum | 33667 |
| Variance | 1.393163959 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 1834 | |
| 8 | 1230 | |
| 9 | 510 | 11.6% |
| 6 | 407 | 9.3% |
| 10 | 256 | 5.8% |
| 11 | 77 | 1.8% |
| 5 | 60 | 1.4% |
| 12 | 16 | 0.4% |
| 4 | 4 | 0.1% |
| 13 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 3 | 1 | < 0.1% |
| 4 | 4 | 0.1% |
| 5 | 60 | 1.4% |
| 6 | 407 | 9.3% |
| 7 | 1834 | |
| 8 | 1230 | |
| 9 | 510 | 11.6% |
| 10 | 256 | 5.8% |
| 11 | 77 | 1.8% |
| 12 | 16 | 0.4% |
| Value | Count | Frequency (%) |
| 13 | 3 | 0.1% |
| 12 | 16 | 0.4% |
| 11 | 77 | 1.8% |
| 10 | 256 | 5.8% |
| 9 | 510 | 11.6% |
| 8 | 1230 | |
| 7 | 1834 | |
| 6 | 407 | 9.3% |
| 5 | 60 | 1.4% |
| 4 | 4 | 0.1% |
| Distinct | 202 |
|---|---|
| Distinct (%) | 11.7% |
| Missing | 2677 |
| Missing (%) | 60.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 741.4982578 |
| Minimum | 10 |
|---|---|
| Maximum | 3480 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 190 |
| Q1 | 450 |
| median | 700 |
| Q3 | 970 |
| 95-th percentile | 1459.5 |
| Maximum | 3480 |
| Range | 3470 |
| Interquartile range (IQR) | 520 |
Descriptive statistics
| Standard deviation | 399.378006 |
|---|---|
| Coefficient of variation (CV) | 0.5386095001 |
| Kurtosis | 2.637427297 |
| Mean | 741.4982578 |
| Median Absolute Deviation (MAD) | 260 |
| Skewness | 1.038696699 |
| Sum | 1276860 |
| Variance | 159502.7917 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 700 | 49 | 1.1% |
| 500 | 48 | 1.1% |
| 600 | 47 | 1.1% |
| 400 | 36 | 0.8% |
| 800 | 35 | 0.8% |
| 900 | 33 | 0.8% |
| 300 | 33 | 0.8% |
| 1000 | 31 | 0.7% |
| 620 | 25 | 0.6% |
| 1100 | 22 | 0.5% |
| Other values (192) | 1363 | |
| (Missing) | 2677 |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 70 | 3 | 0.1% |
| 80 | 5 | |
| 90 | 2 | < 0.1% |
| 100 | 9 | |
| 110 | 3 | 0.1% |
| 120 | 4 |
| Value | Count | Frequency (%) |
| 3480 | 1 | |
| 3260 | 1 | |
| 2730 | 1 | |
| 2330 | 1 | |
| 2220 | 2 | |
| 2160 | 1 | |
| 2150 | 1 | |
| 2100 | 1 | |
| 2060 | 1 | |
| 2040 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 257.9 KiB |
| 0.0 | |
|---|---|
| 2.0 | 189 |
| 3.0 | 90 |
| 1.0 | 72 |
| 4.0 | 68 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 13197 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 3980 | |
| 2.0 | 189 | 4.3% |
| 3.0 | 90 | 2.0% |
| 1.0 | 72 | 1.6% |
| 4.0 | 68 | 1.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 3980 | |
| 2.0 | 189 | 4.3% |
| 3.0 | 90 | 2.0% |
| 1.0 | 72 | 1.6% |
| 4.0 | 68 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8379 | |
| . | 4399 | |
| 2 | 189 | 1.4% |
| 3 | 90 | 0.7% |
| 1 | 72 | 0.5% |
| 4 | 68 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8798 | |
| Other Punctuation | 4399 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8379 | |
| 2 | 189 | 2.1% |
| 3 | 90 | 1.0% |
| 1 | 72 | 0.8% |
| 4 | 68 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4399 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13197 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8379 | |
| . | 4399 | |
| 2 | 189 | 1.4% |
| 3 | 90 | 0.7% |
| 1 | 72 | 0.5% |
| 4 | 68 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13197 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8379 | |
| . | 4399 | |
| 2 | 189 | 1.4% |
| 3 | 90 | 0.7% |
| 1 | 72 | 0.5% |
| 4 | 68 | 0.5% |
| Distinct | 25 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.112349329 |
| Minimum | 0.5 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 0.5 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1.75 |
| median | 2.25 |
| Q3 | 2.5 |
| 95-th percentile | 3.5 |
| Maximum | 8 |
| Range | 7.5 |
| Interquartile range (IQR) | 0.75 |
Descriptive statistics
| Standard deviation | 0.7714183773 |
|---|---|
| Coefficient of variation (CV) | 0.3651945096 |
| Kurtosis | 1.610740405 |
| Mean | 2.112349329 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.5463799915 |
| Sum | 9288 |
| Variance | 0.5950863129 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.5 | 1066 | |
| 1 | 798 | |
| 1.75 | 641 | |
| 2.25 | 424 | 9.6% |
| 2 | 392 | 8.9% |
| 1.5 | 268 | 6.1% |
| 2.75 | 249 | 5.7% |
| 3.5 | 159 | 3.6% |
| 3 | 154 | 3.5% |
| 3.25 | 116 | 2.6% |
| Other values (15) | 130 | 3.0% |
| Value | Count | Frequency (%) |
| 0.5 | 1 | < 0.1% |
| 0.75 | 16 | 0.4% |
| 1 | 798 | |
| 1.25 | 4 | 0.1% |
| 1.5 | 268 | 6.1% |
| 1.75 | 641 | |
| 2 | 392 | 8.9% |
| 2.25 | 424 | 9.6% |
| 2.5 | 1066 | |
| 2.75 | 249 | 5.7% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7.5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 5.75 | 1 | < 0.1% |
| 5.5 | 1 | < 0.1% |
| 5.25 | 4 | 0.1% |
| 5 | 3 | 0.1% |
| 4.75 | 5 | 0.1% |
| 4.5 | 18 | |
| 4.25 | 14 |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.367810866 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9076104406 |
|---|---|
| Coefficient of variation (CV) | 0.2694956684 |
| Kurtosis | 1.191415866 |
| Mean | 3.367810866 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.40512214 |
| Sum | 14815 |
| Variance | 0.8237567118 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1945 | |
| 4 | 1426 | |
| 2 | 581 | 13.2% |
| 5 | 340 | 7.7% |
| 1 | 49 | 1.1% |
| 6 | 46 | 1.0% |
| 7 | 9 | 0.2% |
| 9 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 49 | 1.1% |
| 2 | 581 | 13.2% |
| 3 | 1945 | |
| 4 | 1426 | |
| 5 | 340 | 7.7% |
| 6 | 46 | 1.0% |
| 7 | 9 | 0.2% |
| 8 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 9 | 0.2% |
| 6 | 46 | 1.0% |
| 5 | 340 | 7.7% |
| 4 | 1426 | |
| 3 | 1945 | |
| 2 | 581 | 13.2% |
| 1 | 49 | 1.1% |
| Distinct | 490 |
|---|---|
| Distinct (%) | 11.1% |
| Missing | 4 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1776.286234 |
| Minimum | 420 |
|---|---|
| Maximum | 8570 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 420 |
|---|---|
| 5-th percentile | 850 |
| Q1 | 1200 |
| median | 1560 |
| Q3 | 2190 |
| 95-th percentile | 3343 |
| Maximum | 8570 |
| Range | 8150 |
| Interquartile range (IQR) | 990 |
Descriptive statistics
| Standard deviation | 807.3587155 |
|---|---|
| Coefficient of variation (CV) | 0.4545206172 |
| Kurtosis | 3.094953254 |
| Mean | 1776.286234 |
| Median Absolute Deviation (MAD) | 440 |
| Skewness | 1.369814349 |
| Sum | 7806778 |
| Variance | 651828.0956 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1220 | 46 | 1.0% |
| 1290 | 44 | 1.0% |
| 1300 | 44 | 1.0% |
| 1250 | 41 | 0.9% |
| 1010 | 41 | 0.9% |
| 1280 | 41 | 0.9% |
| 1120 | 40 | 0.9% |
| 1150 | 39 | 0.9% |
| 1200 | 39 | 0.9% |
| 1340 | 39 | 0.9% |
| Other values (480) | 3981 |
| Value | Count | Frequency (%) |
| 420 | 1 | < 0.1% |
| 470 | 1 | < 0.1% |
| 520 | 1 | < 0.1% |
| 530 | 1 | < 0.1% |
| 540 | 1 | < 0.1% |
| 550 | 1 | < 0.1% |
| 560 | 2 | |
| 570 | 3 | |
| 580 | 1 | < 0.1% |
| 590 | 3 |
| Value | Count | Frequency (%) |
| 8570 | 1 | |
| 7880 | 1 | |
| 6090 | 1 | |
| 5830 | 1 | |
| 5670 | 1 | |
| 5490 | 1 | |
| 5370 | 1 | |
| 5180 | 1 | |
| 5160 | 1 | |
| 5070 | 1 |
| Distinct | 441 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1988.22965 |
| Minimum | 720 |
|---|---|
| Maximum | 5790 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 720 |
|---|---|
| 5-th percentile | 1140 |
| Q1 | 1500 |
| median | 1830 |
| Q3 | 2360 |
| 95-th percentile | 3330 |
| Maximum | 5790 |
| Range | 5070 |
| Interquartile range (IQR) | 860 |
Descriptive statistics
| Standard deviation | 680.6366506 |
|---|---|
| Coefficient of variation (CV) | 0.342333015 |
| Kurtosis | 1.445975461 |
| Mean | 1988.22965 |
| Median Absolute Deviation (MAD) | 400 |
| Skewness | 1.090457809 |
| Sum | 8744234 |
| Variance | 463266.2502 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1610 | 44 | 1.0% |
| 1560 | 43 | 1.0% |
| 1640 | 39 | 0.9% |
| 1540 | 39 | 0.9% |
| 1390 | 38 | 0.9% |
| 1720 | 38 | 0.9% |
| 1520 | 37 | 0.8% |
| 1440 | 37 | 0.8% |
| 1580 | 37 | 0.8% |
| 1660 | 37 | 0.8% |
| Other values (431) | 4009 |
| Value | Count | Frequency (%) |
| 720 | 1 | < 0.1% |
| 750 | 1 | < 0.1% |
| 760 | 1 | < 0.1% |
| 780 | 2 | |
| 820 | 2 | |
| 828 | 1 | < 0.1% |
| 830 | 2 | |
| 840 | 3 | |
| 850 | 1 | < 0.1% |
| 860 | 2 |
| Value | Count | Frequency (%) |
| 5790 | 2 | |
| 5340 | 1 | |
| 5110 | 1 | |
| 5070 | 1 | |
| 4920 | 2 | |
| 4760 | 2 | |
| 4700 | 1 | |
| 4670 | 1 | |
| 4630 | 2 | |
| 4590 | 1 |
lat
Real number (ℝ≥0)
| Distinct | 2807 |
|---|---|
| Distinct (%) | 63.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4454.140702 |
| Minimum | 47.1647 |
|---|---|
| Maximum | 47776 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 47.1647 |
|---|---|
| 5-th percentile | 47.3098 |
| Q1 | 47.48775 |
| median | 47.6057 |
| Q3 | 47.7011 |
| 95-th percentile | 47548.3 |
| Maximum | 47776 |
| Range | 47728.8353 |
| Interquartile range (IQR) | 0.21335 |
Descriptive statistics
| Standard deviation | 13783.62797 |
|---|---|
| Coefficient of variation (CV) | 3.094565011 |
| Kurtosis | 5.892520644 |
| Mean | 4454.140702 |
| Median Absolute Deviation (MAD) | 0.1051 |
| Skewness | 2.80886717 |
| Sum | 19593764.95 |
| Variance | 189988400 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47.6624 | 6 | 0.1% |
| 47.5396 | 6 | 0.1% |
| 47.6886 | 6 | 0.1% |
| 47681 | 6 | 0.1% |
| 47.5818 | 6 | 0.1% |
| 47.7168 | 5 | 0.1% |
| 47.6934 | 5 | 0.1% |
| 47.6853 | 5 | 0.1% |
| 47.6842 | 5 | 0.1% |
| 47.54 | 5 | 0.1% |
| Other values (2797) | 4344 |
| Value | Count | Frequency (%) |
| 47.1647 | 1 | |
| 47.1775 | 1 | |
| 47.1776 | 1 | |
| 47.1795 | 1 | |
| 47.1913 | 1 | |
| 47.1937 | 1 | |
| 47.1938 | 1 | |
| 47.1941 | 1 | |
| 47.1943 | 1 | |
| 47.1955 | 1 |
| Value | Count | Frequency (%) |
| 47776 | 1 | < 0.1% |
| 47774 | 4 | |
| 47772 | 2 | |
| 47771 | 1 | < 0.1% |
| 47762 | 1 | < 0.1% |
| 47759 | 1 | < 0.1% |
| 47757 | 1 | < 0.1% |
| 47754 | 2 | |
| 47753 | 1 | < 0.1% |
| 47752 | 2 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 257.9 KiB |
| 0.0 | |
|---|---|
| 1.0 | 35 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 13197 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 4364 | |
| 1.0 | 35 | 0.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 4364 | |
| 1.0 | 35 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8763 | |
| . | 4399 | |
| 1 | 35 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8798 | |
| Other Punctuation | 4399 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8763 | |
| 1 | 35 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4399 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13197 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8763 | |
| . | 4399 | |
| 1 | 35 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13197 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8763 | |
| . | 4399 | |
| 1 | 35 | 0.3% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.484762338 |
| Minimum | 1 |
|---|---|
| Maximum | 3.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 3.5 |
| Range | 2.5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5390735904 |
|---|---|
| Coefficient of variation (CV) | 0.3630706253 |
| Kurtosis | -0.4506863326 |
| Mean | 1.484762338 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.6499864076 |
| Sum | 6528.5 |
| Variance | 0.2906003359 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2212 | |
| 2 | 1637 | |
| 1.5 | 391 | 8.9% |
| 3 | 125 | 2.8% |
| 2.5 | 31 | 0.7% |
| 3.5 | 1 | < 0.1% |
| (Missing) | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 2212 | |
| 1.5 | 391 | 8.9% |
| 2 | 1637 | |
| 2.5 | 31 | 0.7% |
| 3 | 125 | 2.8% |
| 3.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.5 | 1 | < 0.1% |
| 3 | 125 | 2.8% |
| 2.5 | 31 | 0.7% |
| 2 | 1637 | |
| 1.5 | 391 | 8.9% |
| 1 | 2212 |
| Distinct | 52 |
|---|---|
| Distinct (%) | 24.5% |
| Missing | 4187 |
| Missing (%) | 95.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1995.830189 |
| Minimum | 1940 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 1940 |
|---|---|
| 5-th percentile | 1966.65 |
| Q1 | 1987 |
| median | 1998 |
| Q3 | 2007 |
| 95-th percentile | 2014 |
| Maximum | 2015 |
| Range | 75 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 15.06162488 |
|---|---|
| Coefficient of variation (CV) | 0.007546546277 |
| Kurtosis | 1.266502905 |
| Mean | 1995.830189 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -1.076228714 |
| Sum | 423116 |
| Variance | 226.852544 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 15 | 0.3% |
| 2007 | 10 | 0.2% |
| 2013 | 10 | 0.2% |
| 2004 | 9 | 0.2% |
| 2002 | 9 | 0.2% |
| 1998 | 8 | 0.2% |
| 1991 | 8 | 0.2% |
| 2010 | 8 | 0.2% |
| 2005 | 8 | 0.2% |
| 2000 | 7 | 0.2% |
| Other values (42) | 120 | 2.7% |
| (Missing) | 4187 |
| Value | Count | Frequency (%) |
| 1940 | 1 | < 0.1% |
| 1945 | 1 | < 0.1% |
| 1950 | 2 | |
| 1955 | 1 | < 0.1% |
| 1957 | 1 | < 0.1% |
| 1958 | 2 | |
| 1965 | 3 | |
| 1968 | 1 | < 0.1% |
| 1970 | 2 | |
| 1972 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2015 | 4 | 0.1% |
| 2014 | 15 | |
| 2013 | 10 | |
| 2012 | 2 | < 0.1% |
| 2011 | 5 | 0.1% |
| 2010 | 8 | |
| 2009 | 3 | 0.1% |
| 2008 | 1 | < 0.1% |
| 2007 | 10 | |
| 2006 | 5 | 0.1% |
| Distinct | 116 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1970.903842 |
| Minimum | 1900 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1916 |
| Q1 | 1951 |
| median | 1974 |
| Q3 | 1996 |
| 95-th percentile | 2011 |
| Maximum | 2015 |
| Range | 115 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 29.16811726 |
|---|---|
| Coefficient of variation (CV) | 0.0147993609 |
| Kurtosis | -0.6648264868 |
| Mean | 1970.903842 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | -0.4507994939 |
| Sum | 8670006 |
| Variance | 850.7790644 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 122 | 2.8% |
| 2005 | 92 | 2.1% |
| 2004 | 91 | 2.1% |
| 1977 | 87 | 2.0% |
| 1968 | 84 | 1.9% |
| 1978 | 84 | 1.9% |
| 2006 | 83 | 1.9% |
| 2007 | 79 | 1.8% |
| 2008 | 79 | 1.8% |
| 1962 | 77 | 1.8% |
| Other values (106) | 3521 |
| Value | Count | Frequency (%) |
| 1900 | 12 | |
| 1901 | 7 | 0.2% |
| 1902 | 3 | 0.1% |
| 1903 | 10 | |
| 1904 | 11 | |
| 1905 | 12 | |
| 1906 | 16 | |
| 1907 | 11 | |
| 1908 | 22 | |
| 1909 | 22 |
| Value | Count | Frequency (%) |
| 2015 | 10 | 0.2% |
| 2014 | 122 | |
| 2013 | 40 | 0.9% |
| 2012 | 37 | 0.8% |
| 2011 | 15 | 0.3% |
| 2010 | 24 | 0.5% |
| 2009 | 48 | 1.1% |
| 2008 | 79 | |
| 2007 | 79 | |
| 2006 | 83 |
long
Real number (ℝ)
| Distinct | 575 |
|---|---|
| Distinct (%) | 13.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -110838.0874 |
| Minimum | -122505 |
|---|---|
| Maximum | -121.76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4399 |
| Negative (%) | 100.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | -122505 |
|---|---|
| 5-th percentile | -122386 |
| Q1 | -122321 |
| median | -122212 |
| Q3 | -122082.5 |
| 95-th percentile | -122.279 |
| Maximum | -121.76 |
| Range | 122383.24 |
| Interquartile range (IQR) | 238.5 |
Descriptive statistics
| Standard deviation | 35499.48961 |
|---|---|
| Coefficient of variation (CV) | -0.3202824087 |
| Kurtosis | 5.839820182 |
| Mean | -110838.0874 |
| Median Absolute Deviation (MAD) | 113 |
| Skewness | 2.799465297 |
| Sum | -487576746.5 |
| Variance | 1260213763 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -122362 | 27 | 0.6% |
| -122291 | 25 | 0.6% |
| -122301 | 24 | 0.5% |
| -122352 | 23 | 0.5% |
| -122311 | 23 | 0.5% |
| -122361 | 23 | 0.5% |
| -122298 | 22 | 0.5% |
| -122391 | 22 | 0.5% |
| -122288 | 22 | 0.5% |
| -122354 | 22 | 0.5% |
| Other values (565) | 4166 |
| Value | Count | Frequency (%) |
| -122505 | 2 | |
| -122482 | 1 | < 0.1% |
| -122479 | 1 | < 0.1% |
| -122473 | 1 | < 0.1% |
| -122472 | 1 | < 0.1% |
| -122463 | 1 | < 0.1% |
| -122462 | 3 | |
| -122456 | 1 | < 0.1% |
| -122455 | 1 | < 0.1% |
| -122452 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| -121.76 | 1 | < 0.1% |
| -121.77 | 1 | < 0.1% |
| -121.78 | 1 | < 0.1% |
| -121.86 | 2 | |
| -121.87 | 1 | < 0.1% |
| -121.88 | 2 | |
| -121.89 | 4 | |
| -121.9 | 1 | < 0.1% |
| -121.91 | 1 | < 0.1% |
| -121.92 | 2 |
| Distinct | 53 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 3 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 96.25022748 |
| Minimum | 0 |
|---|---|
| Maximum | 2015 |
| Zeros | 4184 |
| Zeros (%) | 95.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2015 |
| Range | 2015 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 427.6534754 |
|---|---|
| Coefficient of variation (CV) | 4.44314249 |
| Kurtosis | 15.81063447 |
| Mean | 96.25022748 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.219261092 |
| Sum | 423116 |
| Variance | 182887.4951 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4184 | |
| 2014 | 15 | 0.3% |
| 2013 | 10 | 0.2% |
| 2007 | 10 | 0.2% |
| 2002 | 9 | 0.2% |
| 2004 | 9 | 0.2% |
| 2005 | 8 | 0.2% |
| 2010 | 8 | 0.2% |
| 1998 | 8 | 0.2% |
| 1991 | 8 | 0.2% |
| Other values (43) | 127 | 2.9% |
| Value | Count | Frequency (%) |
| 0 | 4184 | |
| 1940 | 1 | < 0.1% |
| 1945 | 1 | < 0.1% |
| 1950 | 2 | < 0.1% |
| 1955 | 1 | < 0.1% |
| 1957 | 1 | < 0.1% |
| 1958 | 2 | < 0.1% |
| 1965 | 3 | 0.1% |
| 1968 | 1 | < 0.1% |
| 1970 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2015 | 4 | 0.1% |
| 2014 | 15 | |
| 2013 | 10 | |
| 2012 | 2 | < 0.1% |
| 2011 | 5 | 0.1% |
| 2010 | 8 | |
| 2009 | 3 | 0.1% |
| 2008 | 1 | < 0.1% |
| 2007 | 10 | |
| 2006 | 5 | 0.1% |
| Distinct | 2950 |
|---|---|
| Distinct (%) | 67.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14267.01819 |
| Minimum | 609 |
|---|---|
| Maximum | 1074218 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 609 |
|---|---|
| 5-th percentile | 1881.4 |
| Q1 | 5082 |
| median | 7684.5 |
| Q3 | 10693 |
| 95-th percentile | 40444.35 |
| Maximum | 1074218 |
| Range | 1073609 |
| Interquartile range (IQR) | 5611 |
Descriptive statistics
| Standard deviation | 36474.92835 |
|---|---|
| Coefficient of variation (CV) | 2.556590863 |
| Kurtosis | 268.1891864 |
| Mean | 14267.01819 |
| Median Absolute Deviation (MAD) | 2684.5 |
| Skewness | 12.94256235 |
| Sum | 62746346 |
| Variance | 1330420398 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6000 | 63 | 1.4% |
| 5000 | 63 | 1.4% |
| 4000 | 48 | 1.1% |
| 7200 | 42 | 1.0% |
| 4800 | 38 | 0.9% |
| 4500 | 24 | 0.5% |
| 7500 | 23 | 0.5% |
| 8400 | 21 | 0.5% |
| 9000 | 21 | 0.5% |
| 9600 | 21 | 0.5% |
| Other values (2940) | 4034 |
| Value | Count | Frequency (%) |
| 609 | 1 | |
| 635 | 1 | |
| 638 | 1 | |
| 700 | 1 | |
| 711 | 1 | |
| 747 | 2 | |
| 762 | 1 | |
| 780 | 1 | |
| 812 | 1 | |
| 819 | 1 |
| Value | Count | Frequency (%) |
| 1074218 | 1 | |
| 871200 | 1 | |
| 507038 | 1 | |
| 493534 | 1 | |
| 453895 | 1 | |
| 432036 | 1 | |
| 423838 | 1 | |
| 329903 | 1 | |
| 286355 | 1 | |
| 280962 | 1 |
| Distinct | 1554 |
|---|---|
| Distinct (%) | 35.4% |
| Missing | 3 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40178389.43 |
| Minimum | 89950 |
|---|---|
| Maximum | 3635000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 89950 |
|---|---|
| 5-th percentile | 210000 |
| Q1 | 319950 |
| median | 450000 |
| Q3 | 650000 |
| 95-th percentile | 1250000 |
| Maximum | 3635000000 |
| Range | 3634910050 |
| Interquartile range (IQR) | 330050 |
Descriptive statistics
| Standard deviation | 245974676.2 |
|---|---|
| Coefficient of variation (CV) | 6.122064118 |
| Kurtosis | 50.09732109 |
| Mean | 40178389.43 |
| Median Absolute Deviation (MAD) | 153000 |
| Skewness | 6.748377121 |
| Sum | 1.766241999 × 1011 |
| Variance | 6.050354135 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 350000 | 43 | 1.0% |
| 450000 | 41 | 0.9% |
| 375000 | 36 | 0.8% |
| 325000 | 35 | 0.8% |
| 525000 | 34 | 0.8% |
| 400000 | 31 | 0.7% |
| 500000 | 30 | 0.7% |
| 250000 | 30 | 0.7% |
| 440000 | 29 | 0.7% |
| 475000 | 28 | 0.6% |
| Other values (1544) | 4059 |
| Value | Count | Frequency (%) |
| 89950 | 1 | |
| 105500 | 1 | |
| 106000 | 1 | |
| 110000 | 2 | |
| 111300 | 1 | |
| 114000 | 1 | |
| 115000 | 1 | |
| 121800 | 1 | |
| 122000 | 1 | |
| 123000 | 1 |
| Value | Count | Frequency (%) |
| 3635000000 | 1 | |
| 2546000000 | 1 | |
| 2538000000 | 1 | |
| 2532000000 | 1 | |
| 2458000000 | 1 | |
| 2415000000 | 1 | |
| 2367000000 | 1 | |
| 2288000000 | 1 | |
| 2193000000 | 1 | |
| 2125000000 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 257.9 KiB |
| 3.0 | |
|---|---|
| 4.0 | |
| 5.0 | |
| 2.0 | 35 |
| 1.0 | 7 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 13197 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 3.0 |
| 4th row | 3.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 3.0 | 2893 | |
| 4.0 | 1116 | 25.4% |
| 5.0 | 348 | 7.9% |
| 2.0 | 35 | 0.8% |
| 1.0 | 7 | 0.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 3.0 | 2893 | |
| 4.0 | 1116 | 25.4% |
| 5.0 | 348 | 7.9% |
| 2.0 | 35 | 0.8% |
| 1.0 | 7 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 4399 | |
| 0 | 4399 | |
| 3 | 2893 | |
| 4 | 1116 | 8.5% |
| 5 | 348 | 2.6% |
| 2 | 35 | 0.3% |
| 1 | 7 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8798 | |
| Other Punctuation | 4399 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4399 | |
| 3 | 2893 | |
| 4 | 1116 | 12.7% |
| 5 | 348 | 4.0% |
| 2 | 35 | 0.4% |
| 1 | 7 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4399 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13197 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 4399 | |
| 0 | 4399 | |
| 3 | 2893 | |
| 4 | 1116 | 8.5% |
| 5 | 348 | 2.6% |
| 2 | 35 | 0.3% |
| 1 | 7 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13197 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 4399 | |
| 0 | 4399 | |
| 3 | 2893 | |
| 4 | 1116 | 8.5% |
| 5 | 348 | 2.6% |
| 2 | 35 | 0.3% |
| 1 | 7 | 0.1% |
| Distinct | 2791 |
|---|---|
| Distinct (%) | 63.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12271.97977 |
| Minimum | 748 |
|---|---|
| Maximum | 871200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 748 |
|---|---|
| 5-th percentile | 2138.8 |
| Q1 | 5110 |
| median | 7688 |
| Q3 | 10077.5 |
| 95-th percentile | 35900.6 |
| Maximum | 871200 |
| Range | 870452 |
| Interquartile range (IQR) | 4967.5 |
Descriptive statistics
| Standard deviation | 26827.1341 |
|---|---|
| Coefficient of variation (CV) | 2.186047778 |
| Kurtosis | 304.5097131 |
| Mean | 12271.97977 |
| Median Absolute Deviation (MAD) | 2488 |
| Skewness | 13.29356308 |
| Sum | 53984439 |
| Variance | 719695124.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 87 | 2.0% |
| 4000 | 71 | 1.6% |
| 6000 | 67 | 1.5% |
| 7200 | 33 | 0.8% |
| 4800 | 32 | 0.7% |
| 8000 | 27 | 0.6% |
| 7500 | 23 | 0.5% |
| 4500 | 23 | 0.5% |
| 9000 | 21 | 0.5% |
| 4080 | 21 | 0.5% |
| Other values (2781) | 3994 |
| Value | Count | Frequency (%) |
| 748 | 1 | < 0.1% |
| 817 | 1 | < 0.1% |
| 824 | 1 | < 0.1% |
| 899 | 1 | < 0.1% |
| 925 | 1 | < 0.1% |
| 928 | 1 | < 0.1% |
| 942 | 3 | |
| 953 | 1 | < 0.1% |
| 967 | 1 | < 0.1% |
| 976 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 871200 | 1 | |
| 560617 | 1 | |
| 325393 | 1 | |
| 263492 | 1 | |
| 230868 | 1 | |
| 229125 | 1 | |
| 223463 | 1 | |
| 222591 | 1 | |
| 220232 | 1 | |
| 219542 | 2 |
| Distinct | 539 |
|---|---|
| Distinct (%) | 12.3% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2066.738972 |
| Minimum | 420 |
|---|---|
| Maximum | 12050 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 34.5 KiB |
Quantile statistics
| Minimum | 420 |
|---|---|
| 5-th percentile | 940 |
| Q1 | 1420 |
| median | 1900 |
| Q3 | 2550 |
| 95-th percentile | 3750 |
| Maximum | 12050 |
| Range | 11630 |
| Interquartile range (IQR) | 1130 |
Descriptive statistics
| Standard deviation | 897.1812225 |
|---|---|
| Coefficient of variation (CV) | 0.4341047586 |
| Kurtosis | 5.320023485 |
| Mean | 2066.738972 |
| Median Absolute Deviation (MAD) | 550 |
| Skewness | 1.384835176 |
| Sum | 9089518 |
| Variance | 804934.1461 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1900 | 37 | 0.8% |
| 1820 | 36 | 0.8% |
| 1440 | 35 | 0.8% |
| 1660 | 34 | 0.8% |
| 1300 | 33 | 0.8% |
| 1280 | 33 | 0.8% |
| 1540 | 31 | 0.7% |
| 1720 | 30 | 0.7% |
| 1740 | 30 | 0.7% |
| 1940 | 29 | 0.7% |
| Other values (529) | 4070 |
| Value | Count | Frequency (%) |
| 420 | 1 | |
| 470 | 1 | |
| 520 | 1 | |
| 530 | 1 | |
| 540 | 1 | |
| 560 | 2 | |
| 570 | 1 | |
| 590 | 2 | |
| 600 | 2 | |
| 630 | 1 |
| Value | Count | Frequency (%) |
| 12050 | 1 | |
| 7880 | 1 | |
| 7710 | 1 | |
| 7050 | 1 | |
| 6510 | 1 | |
| 6500 | 1 | |
| 6200 | 1 | |
| 5990 | 1 | |
| 5860 | 1 | |
| 5830 | 1 |
date
Date
| Distinct | 329 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 34.5 KiB |
| Minimum | 2014-05-02 00:00:00 |
|---|---|
| Maximum | 2015-05-24 00:00:00 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | zipcode | grade | sqft_basement | view | bathrooms | bedrooms | sqft_above | sqft_living15 | lat | waterfront | floors | yr_renovated | yr_built | long | jhygtf | sqft_lot | price | condition | sqft_lot15 | sqft_living | date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 32828 | 98058.0 | 8.0 | NaN | 0.0 | 2.25 | 3.0 | 1780.0 | 2080.0 | 47.4539 | 0.0 | 1.0 | NaN | 1967.0 | -122.15 | 0.0 | 10395.0 | 300000.0 | 3.0 | 9360.0 | 1780.0 | 2014-06-26 |
| 1 | 6012 | 98117.0 | 8.0 | NaN | 0.0 | 2.50 | 3.0 | 1350.0 | 1350.0 | 47.6758 | 0.0 | 3.0 | NaN | 2005.0 | -122386.00 | 0.0 | 2053.0 | 477000.0 | 3.0 | 4150.0 | 1350.0 | 2014-05-19 |
| 2 | 34920 | 98028.0 | 8.0 | NaN | 0.0 | 3.00 | 4.0 | 2450.0 | 2460.0 | 47.7721 | 0.0 | 2.0 | NaN | 2004.0 | -122235.00 | 0.0 | 4668.0 | 500000.0 | 3.0 | 4895.0 | 2450.0 | 2015-04-06 |
| 3 | 923 | 98133.0 | 7.0 | NaN | 0.0 | 1.75 | 3.0 | 1420.0 | 1740.0 | 47.7535 | 0.0 | 1.0 | NaN | 1954.0 | -122354.00 | 0.0 | 8250.0 | 265000.0 | 3.0 | 8000.0 | 1420.0 | 2014-08-26 |
| 4 | 9859 | 98055.0 | 7.0 | 240.0 | 0.0 | 1.00 | 2.0 | 1180.0 | 1490.0 | 47.4342 | 0.0 | 1.0 | NaN | 1956.0 | -122195.00 | 0.0 | 81892.0 | 360000.0 | 3.0 | 1863.0 | 1420.0 | 2014-05-09 |
| 5 | 13488 | 98052.0 | 9.0 | NaN | 0.0 | 2.50 | 4.0 | 2110.0 | 2180.0 | 47.6374 | 0.0 | 2.0 | NaN | 1999.0 | -122111.00 | 0.0 | 6069.0 | 689000.0 | 3.0 | 9000.0 | 2110.0 | 2015-03-25 |
| 6 | 15973 | 98118.0 | 7.0 | 1100.0 | 0.0 | 1.75 | 4.0 | 1100.0 | 1600.0 | 47543.0000 | 0.0 | 1.0 | NaN | 1955.0 | -122.28 | 0.0 | 7475.0 | 375000.0 | 5.0 | 5766.0 | 2200.0 | 2014-06-04 |
| 7 | 24265 | 98052.0 | 10.0 | NaN | 0.0 | 3.00 | 3.0 | 2960.0 | 2640.0 | 47.7183 | 0.0 | 2.0 | NaN | 1995.0 | -122.10 | 0.0 | 42159.0 | 849000.0 | 3.0 | 25209.0 | 2960.0 | 2014-07-29 |
| 8 | 29055 | 98003.0 | 7.0 | NaN | 0.0 | 1.75 | 3.0 | 1320.0 | 1550.0 | 47.3257 | 0.0 | 1.0 | NaN | 1956.0 | -122296.00 | 0.0 | 17390.0 | 199000.0 | 4.0 | 19265.0 | 1320.0 | 2014-08-13 |
| 9 | 2550 | 98004.0 | 11.0 | NaN | 0.0 | 3.50 | 4.0 | 4280.0 | 2360.0 | 47.5979 | 0.0 | 2.0 | NaN | 2005.0 | -122197.00 | 0.0 | 9583.0 | 1600000.0 | 3.0 | 10031.0 | 4280.0 | 2014-06-06 |
Last rows
| df_index | zipcode | grade | sqft_basement | view | bathrooms | bedrooms | sqft_above | sqft_living15 | lat | waterfront | floors | yr_renovated | yr_built | long | jhygtf | sqft_lot | price | condition | sqft_lot15 | sqft_living | date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4389 | 48630 | 98034.0 | 7.0 | NaN | 0.0 | 2.00 | 4.0 | 2100.0 | 1720.0 | 47.7239 | 0.0 | 1.0 | NaN | 1972.0 | -122173.0 | 0.0 | 12620.0 | 5.000000e+05 | 4.0 | 7840.0 | 2100.0 | 2014-08-27 |
| 4390 | 144 | 98022.0 | 7.0 | 1580.0 | 0.0 | 1.75 | 4.0 | 2150.0 | 1880.0 | 47.1775 | 0.0 | 1.0 | NaN | 1974.0 | -122022.0 | 0.0 | 16980.0 | 3.600000e+05 | 4.0 | 16963.0 | NaN | 2014-08-25 |
| 4391 | 59966 | 98116.0 | 7.0 | 800.0 | 0.0 | 1.00 | 3.0 | 1560.0 | 1690.0 | 47.5705 | 0.0 | 1.0 | NaN | 1964.0 | -122384.0 | 0.0 | 5012.0 | 5.320000e+05 | 3.0 | 4800.0 | 2360.0 | 2015-04-16 |
| 4392 | 38623 | 98005.0 | 11.0 | NaN | 0.0 | 2.75 | 4.0 | 3200.0 | 4050.0 | 47.6402 | 0.0 | 2.0 | NaN | 1984.0 | -122171.0 | 0.0 | 13729.0 | 1.272000e+09 | 3.0 | 16921.0 | 3200.0 | 2014-09-29 |
| 4393 | 9343 | 98115.0 | 8.0 | NaN | 0.0 | 2.50 | 3.0 | 2060.0 | 1240.0 | 47.6961 | 0.0 | 2.0 | 2006.0 | 1924.0 | -122316.0 | 2006.0 | 9715.0 | 8.400000e+05 | 3.0 | 7072.0 | 2060.0 | 2014-08-20 |
| 4394 | 14375 | 98038.0 | 7.0 | NaN | 0.0 | 2.00 | 3.0 | 1220.0 | 1570.0 | 47.3523 | 0.0 | 1.0 | NaN | 1994.0 | -122059.0 | 0.0 | 6404.0 | 2.499000e+05 | 3.0 | 7000.0 | 1220.0 | 2014-07-17 |
| 4395 | 29775 | 98155.0 | 7.0 | 800.0 | 0.0 | 2.00 | 3.0 | 1140.0 | 1940.0 | 47774.0000 | 0.0 | 1.0 | NaN | 1978.0 | -122283.0 | 0.0 | 16300.0 | 3.990000e+05 | 3.0 | 11250.0 | 1940.0 | 2014-08-08 |
| 4396 | 5801 | 98022.0 | 7.0 | 1220.0 | 0.0 | 1.75 | 3.0 | 1390.0 | 2140.0 | 47.2585 | 0.0 | 1.0 | NaN | 1981.0 | -121925.0 | 0.0 | 117176.0 | 3.780000e+05 | 3.0 | 142005.0 | 2610.0 | 2014-10-20 |
| 4397 | 16610 | 98034.0 | 7.0 | NaN | 0.0 | 1.50 | 3.0 | 1340.0 | 1620.0 | 47.7168 | 0.0 | 1.0 | NaN | 1972.0 | -122192.0 | 0.0 | 6500.0 | 3.870000e+05 | 3.0 | 7107.0 | 1340.0 | 2014-10-27 |
| 4398 | 235 | 98117.0 | 6.0 | NaN | 0.0 | 1.00 | 2.0 | 910.0 | 1480.0 | 47683.0000 | 0.0 | 1.0 | NaN | 1914.0 | -122374.0 | 0.0 | 5000.0 | 4.500000e+05 | 4.0 | 5000.0 | 910.0 | 2014-08-26 |